AITopics | Burlington

The signature is a canonical representation of a multidimensional path over an interval. However, it treats all historical information uniformly, offering no intrinsic mechanism for contextualising the relevance of the past. To address this, we introduce the Exponentially Weighted Signature (EWS), generalising the Exponentially Fading Memory (EFM) signature from diagonal to general bounded linear operators. These operators enable cross-channel coupling at the level of temporal weighting together with richer memory dynamics including oscillatory, growth, and regime-dependent behaviour, while preserving the algebraic strengths of the classical signature. We show that the EWS is the unique solution to a linear controlled differential equation on the tensor algebra, and that it generalises both state-space models and the Laplace and Fourier transforms of the path. The group-like structure of the EWS enables efficient computation and makes the framework amenable to gradient-based learning, with the full semigroup action parametrised by and learned through its generator. We use this framework to empirically demonstrate the expressivity gap between the EWS and both the signature and EFM on two SDE-based regression tasks.

artificial intelligence, machine learning, signature, (17 more...)

arXiv.org Machine Learning

2603.19198

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Towards Anytime-Valid Statistical Watermarking

Huang, Baihe, Xu, Eric, Ramchandran, Kannan, Jiao, Jiantao, Jordan, Michael I.

arXiv.org Machine LearningFeb-20-2026

The proliferation of Large Language Models (LLMs) necessitates efficient mechanisms to distinguish machine-generated content from human text. While statistical watermarking has emerged as a promising solution, existing methods suffer from two critical limitations: the lack of a principled approach for selecting sampling distributions and the reliance on fixed-horizon hypothesis testing, which precludes valid early stopping. In this paper, we bridge this gap by developing the first e-value-based watermarking framework, Anchored E-Watermarking, that unifies optimal sampling with anytime-valid inference. Unlike traditional approaches where optional stopping invalidates Type-I error guarantees, our framework enables valid, anytime-inference by constructing a test supermartingale for the detection process. By leveraging an anchor distribution to approximate the target model, we characterize the optimal e-value with respect to the worst-case log-growth rate and derive the optimal expected stopping time. Our theoretical claims are substantiated by simulations and evaluations on established benchmarks, showing that our framework can significantly enhance sample efficiency, reducing the average token budget required for detection by 13-15% relative to state-of-the-art baselines.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Machine Learning

2602.17608

Country:

Asia > Middle East > Jordan (0.41)
North America > United States > California > Alameda County > Berkeley (0.04)
North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)

Genre: Research Report > Promising Solution (0.48)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs

Neural Information Processing SystemsFeb-16-2026, 12:57:30 GMT

Partial Differential Equations (PDEs) is still lacking. This study introduces PIN-Nacle, a benchmarking tool designed to fill this gap.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Scaling transformer neural networks for skillful and reliable medium-range weather forecasting Tung Nguyen

Neural Information Processing SystemsFeb-16-2026, 03:16:07 GMT

Recently, data-driven approaches for weather forecasting based on deep learning have shown great promise, achieving accuracies that are competitive with operational systems. However, those methods often employ complex, customized architectures without sufficient ablation analysis, making it difficult to understand what truly contributes to their success.

artificial intelligence, machine learning, modeling & simulation, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
North America > United States > Alaska (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Energy (0.68)
Government (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates

Adil SALIM, Dmitry Koralev, Peter Richtarik

Neural Information Processing SystemsFeb-12-2026, 11:47:59 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, operator, proximity operator, (10 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients

Jun Sun, Tianyi Chen, Georgios Giannakis, Zaiyue Yang

Neural Information Processing SystemsFeb-12-2026, 03:12:57 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, communication, gradient, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Quebec > Montreal (0.05)
North America > United States > California > Los Angeles County > Long Beach (0.05)
(13 more...)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.31)

Add feedback

Unlocking Fairness: a Trade-off Revisited

Neural Information Processing SystemsFeb-11-2026, 21:49:12 GMT

artificial intelligence, machine learning, unlocking fairness, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.07)
North America > United States > Massachusetts > Middlesex County > Burlington (0.05)
North America > United States > District of Columbia > Washington (0.05)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

91cff01af640a24e7f9f7a5ab407889f-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 08:45:48 GMT

algorithm, gradient flow, wasserstein gradient flow, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

91cff01af640a24e7f9f7a5ab407889f-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 08:45:40 GMT

algorithm, gradient flow, wasserstein gradient flow, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Burlington (0.04)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Statistical Inference for Manifold Similarity and Alignability across Noisy High-Dimensional Datasets

Chen, Hongrui, Ma, Rong

arXiv.org Machine LearningNov-27-2025

The rapid growth of high-dimensional datasets across various scientific domains has created a pressing need for new statistical methods to compare distributions supported on their underlying structures. Assessing similarity between datasets whose samples lie on low-dimensional manifolds requires robust techniques capable of separating meaningful signal from noise. We propose a principled framework for statistical inference of similarity and alignment between distributions supported on manifolds underlying high-dimensional datasets in the presence of heterogeneous noise. The key idea is to link the low-rank structure of observed data matrices to their underlying manifold geometry. By analyzing the spectrum of the sample covariance under a manifold signal-plus-noise model, we develop a scale-invariant distance measure between datasets based on their principal variance structures. We further introduce a consistent estimator for this distance and a statistical test for manifold alignability, and establish their asymptotic properties using random matrix theory. The proposed framework accommodates heterogeneous noise across datasets and offers an efficient, theoretically grounded approach for comparing high-dimensional datasets with low-dimensional manifold structures. Through extensive simulations and analyses of multi-sample single-cell datasets, we demonstrate that our method achieves superior robustness and statistical power compared with existing approaches.

dataset, eigenvalue, estimator, (15 more...)

arXiv.org Machine Learning

2511.21074

Country: North America > United States > Massachusetts > Middlesex County > Burlington (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Add feedback

Collaborating Authors

Burlington

The Exponentially Weighted Signature

Towards Anytime-Valid Statistical Watermarking

PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs

Scaling transformer neural networks for skillful and reliable medium-range weather forecasting Tung Nguyen

Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates

Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients

Unlocking Fairness: a Trade-off Revisited

91cff01af640a24e7f9f7a5ab407889f-Supplemental.pdf

91cff01af640a24e7f9f7a5ab407889f-Paper.pdf

Statistical Inference for Manifold Similarity and Alignability across Noisy High-Dimensional Datasets